VUDENC: Vulnerability Detection with Deep Learning on a Natural Codebase for Python
نویسندگان
چکیده
Identifying potential vulnerable code is important to improve the security of our software systems. However, manual detection vulnerabilities requires expert knowledge and time-consuming, must be supported by automated techniques. Such vulnerability techniques should achieve a high accuracy, point developers directly fragments, scale real-world software, generalize across boundaries specific project, require no or only moderate setup configuration effort. In this article, we present Vudenc (Vulnerability Detection with Deep Learning on Natural Codebase), deep learning-based tool that automatically learns features from large Python codebase. applies word2vec model identify semantically similar tokens provide vector representation. A network long-short-term memory cells (LSTM) then used classify token sequences at fine-grained level, highlight areas in source are likely contain vulnerabilities, confidence levels for its predictions. To evaluate Vudenc, 1,009 vulnerability-fixing commits different GitHub repositories seven types (SQL injection, XSS, Command XSRF, Remote execution, Path disclosure, Open redirect) training. experimental evaluation, achieves recall 78%–87%, precision 82%–96%, an F1 score 80%–90%. Vudenc’s code, datasets corpus available reproduction. Our results suggest capable outperforming most competitors terms vulnerably capabilities software. Comparable accuracy was achieved synthetic benchmarks, within single projects, much coarser level granularity such as entire files.
منابع مشابه
Melanoma detection with a deep learning model
Background: Skin cancer is one of the most common forms of cancer in the world and melanoma is the deadliest type of skin cancer. Both melanoma and melanocytic nevi begin in melanocytes (cells that produce melanin). However, melanocytic nevi are benign whereas melanoma is malignant. This work proposes a deep learning model for classification of these two lesions. Methods: In this analytic s...
متن کاملVulDeePecker: A Deep Learning-Based System for Vulnerability Detection
The automatic detection of software vulnerabilities is an important research problem. However, existing solutions to this problem rely on human experts to define features and often miss many vulnerabilities (i.e., incurring high false negative rate). In this paper, we initiate the study of using deep learning-based vulnerability detection to relieve human experts from the tedious and subjective...
متن کاملVulnerability of Deep Learning
The Renormalisation Group (RG) provides a framework in which it is possible to assess whether a deep-learning network is sensitive to small changes in the input data and hence prone to error, or susceptible to adversarial attack. Distinct classification outputs are associated with different RG fixed points and sensitivity to small changes in the input data is due to the presence of relevant ope...
متن کاملA Framework for Distributed Deep Learning Layer Design in Python
In this paper, a framework for testing Deep Neural Network (DNN) design in Python is presented. First, big data, machine learning (ML), and Artificial Neural Networks (ANNs) are discussed to familiarize the reader with the importance of such a system. Next, the benefits and detriments of implementing such a system in Python are presented. Lastly, the specifics of the system are explained, and s...
متن کاملAutomated software vulnerability detection with machine learning
Thousands of security vulnerabilities are discovered in production software each year, either reported publicly to the Common Vulnerabilities and Exposures database or discovered internally in proprietary code. Vulnerabilities often manifest themselves in subtle ways that are not obvious to code reviewers or the developers themselves. With the wealth of open source code available for analysis, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information & Software Technology
سال: 2022
ISSN: ['0950-5849', '1873-6025']
DOI: https://doi.org/10.1016/j.infsof.2021.106809